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ISOLATED GENOMIC POLYNUCLEOTIDE FRAGMENTS FROM s 
CHROMOSOME 12 THAT ENCODE HUMAN CARBOXYPEPTIDASE M AND 
5 THE HUMAN MOUSE DOUBLE MINUTE 2 HOMOLOG 

FIELD OF THE INVENTION 

The invention is directed to isolated genomic polynucleotide fragments from the human 
chromosome 12ql3-ql5 region that particularly encode human carboxypeptidase M and human 
10 mouse double minute 2 homolog, vectors and hosts containing these fragments and fragments 
hybridizing to noncoding regions as well as their reverse complements t. The invention is further 
directed to methods of using these fragments to obtain human carboxypeptidase M and human 
mouse double minute 2 homolog and to diagnose, treat, prevent and/or ameliorate a pathological 
disorder. 

15 

BACKGROUND OF THE INVENTION 

Chromosome 12ql3 - ql5 contains genes encoding, for example, interleukin 22, a protein 
tyrosine phosphatase, interferon-gamma, carboxypeptidase M and the human mouse double minute 
2 homolog; the last two of which are discussed in more detail below. The chromosome 12ql3 - 
20 ql5 region is known to be aberrant in tumors such as sarcomas (Oliner et al., Nature 358: 80-3, 
1992). 

HUMAN CARBOXYPEPTIDASE M 

Human carboxypeptidase M is a cell membrane-bound basic carboxypeptidase believed to act by 
25 activating, inactivating and modulating excitatory peptides such as the anaphylatoxins and kinins 
(Tan et al., J. Biol. Chem. 264: 13165-70. 1989). Its expression is increased as monocytes 
differentiate into macrophages (Rehli et al., J. Biol. Chem. 270: 15644-9, 1995). It is also widely 
distributed as an ectoenzyme of specialized epithelia and endothelia. Its ability to convert 
anaphylatoxins to their less active C-terminal des-Arg forms 
30 protects against complement-linked tissue damage. 

HUMAN MOUSE DOUBLE MINUTE 2 HOMOLOG 

Human mouse double minute 2 homolog plays a key role in modulating actions of p53 (Oliner et 
al., supra), in part by targeting p53 for destruction (Ries et al., Cell 103: 321-30, 2000). Over- 
35 expression of this oncogene increases tumorigenic potential. The human mouse double minute 2 
homolog is over-expressed in both sarcomas and some leukemias. In addition to its ability to in 



effect neutralize p53, it reacts also with a retinoblastoma protein. 



SUMMARY OF THE INVENTION 

The invention is directed to isolated genomic polynucleotides, said polynucleotides obtainable from 
5 the human chromosome 12ql3-ql5 region having a nucleotide sequence at least 95% identical to a 

sequence selected from the group consisting of: 

(a) a genomic polynucleotide encoding a polypeptide selected from the group consisting of 

human carboxypeptidase M depicted in SEQ ID NO: 1 or human mouse double minute 2 homolog 

depicted in SEQ ED NO:2, or variants of SEQ ID NOS: 1 or 2; 
10 (b) a genomic polynucleotide selected from the group consisting of SEQ ID NO:3 

which encodes human carboxypeptidase M depicted in SEQ ID NO: 1 and SEQ ED NO:4 which 

encodes human mouse double minute 2 homolog depicted in SEQ ID NO:2, or variants of SEQ ID 

NOS: 3 or 4, 
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35 
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15 (c) a polynucleotide which hybridizes to any one of the polynucleotides specified in (a)-(b) 

and 

(d) a polynucleotide that is a reverse complement to the polynucleotides specified in (a) to 

(c) 

as well as nucleic acid constructs, expression vectors and host cells containing these polynucleotide 
20 sequences. 

The invention further relates to a polynucleotide comprising: 

(a) a genomic double stranded polynucleotide set forth in SEQ ID NO: 3 encoding human 
carboxypeptidase M set forth in SEQ ED NO: 1 and the polynucleotide set forth in SEQ ID NO:4 

25 encoding human mouse double minute 2 homolog set forth in SEQ ID NO:2; 

(b) a polynucleotide that hybridizes to one strand of the polynucleotide of (a) and 

(c) a reverse complement of (a) and (b). 

as well as nucleic acid constructs, expression vectors and host cells containing these polynucleotide 
sequences. 



The polynucleotides of the present invention may be used for the manufacture of a gene therapy for 
the prevention, treatment or amelioration of a medical condition by adding an amount of a 
composition comprising said polynucleotide effective to prevent, treat or ameliorate said medical 
condition. 

The invention is further directed to obtaining these polypeptides by 



2 



(a) culturing host cells comprising these sequences under conditions that provide for the 
expression of said polypeptide and 

(b) recovering said expressed polypeptide. 

The polypeptides obtained may be used to produce antibodies by 
5 (a) optionally conjugating said polypeptide to a carrier protein; 

(b) immunizing a host animal with said polypeptide or peptide-carrier protein conjugate of 
step (a) with an adjuvant and 

(c) obtaining antibody from said immunized host animal. 

10 The invention is further directed to a nucleic acid molecule or reverse complement thereof 

comprising a sequence of nucleotides which specifically hybridizes to noncoding regions of said 
polynucleotide sequences of SEQ ID NO:3 (human carboxypeptidase M gene) or SEQ ID NO:4 
(human mouse double minute 2 homolog gene). These sequences may be used to modulate levels 
of human carboxypeptidase M and human mouse double minute 2 homolog in a subject in need 

15 thereof and specifically for the manufacture of a medicament for prevention, treatment or 

amelioration of a medical condition. As defined herein, a "polynucleotide fragment" may be a 
nucleic acid molecule including DNA, RNA and analogs thereof including protein nucleic acids and 
mixtures thereof and may include a probe and primer. Such molecules are generally of a length 
such that they are statistically unique in the genome of interest. Generally, for a probe or primer to 

20 be unique in the human genome, it contains at least 14 to 16 contiguous nucleotides of a sequence 
complementary to or identical to a target sequence of interest. These polynucleotide fragments can 
be 20, 30, 50, 100, 150, 500, 600, 1000, 2000 or more nucleic acids long. Probes and primers 
may also be referred to as oligonucleotides. As defined herein, a "reverse complement" is a 
molecule encoding a sequence complementary to at least a portion of an RNA molecule or to a 

25 genomic DNA segment and may be used interchangeably with "antisense oligonucleotide". The 
sequence is sufficiently complementary to be able to hybridize with the RNA or DNA, preferably 
under moderate or high stringency conditions to form a stable duplex or triplex. A "reverse 
complement" also includes peptide nucleic acid reverse complement sequences. 

30 The invention is further directed to kits comprising these polynucleotides and kits comprising these 
sequences. In a specific embodiment, the sequence(s) are attached to a substrate. In a specific 
embodiment, the support is a microarray. The microarray may contain a plurality of sequences 
hybridizing to non-coding sequences. As defined herein, a "plurality" of sequences is two or 
more sequences. Alternatively, the microarray comprises non-coding sequences as well as coding 

35 sequences. 
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In a specific embodiment, the noncoding regions are transcription regulatory regions. The 
transcription regulatory regions may be used to produce a heterologous peptide by expressing in a 
host cell, said transcription regulatory region operably linked to a polynucleotide encoding the 
heterologous polypeptide and recovering the expressed heterologous polypeptide. 

5 

The polynucleotides of the present invention may be used to detect a pathological condition or 
susceptibility to a pathological condition in a subject comprising 

(a) isolating genomic DNA from said subject; 

(b) detecting the presence or absence of a variant in said genomic DNA using a probe or 
10 primer derived from a polynucleotide hybridizing to non-coding region(s) of a human 

carboxy peptidase M gene and human mouse double minute 2 homolog gene; and 

(c) diagnosing a pathological condition or susceptibility to a pathological condition based 
on the presence or absence of said variant. 

15 Probes or primers derived from SEQ ID NO:3 (human carboxypeptidase M gene) or SEQ ID NO: 
4 (human mouse double minute 2 homolog gene) may be used to identify variants including but not 
limited to mutations, duplications, translocations, polysomies and mosaicism on the human 
carboxypeptidase M gene or on the human mouse double minute 2 homolog. Therefore, the 
invention is also directed to a method for identifying variants of SEQ ID NO:3 and 4 comprising 

20 (a) isolating genomic DNA from a subject and 

(b) determining the presence or absence of a variant in said genomic DNA using the probes or 
primers. 

DETAILED DESCRIPTION OF THE INVENTION 

25 The invention is directed to isolated genomic polynucleotide fragments that encode human 

carboxypeptidase M and human mouse double minute 2 homolog, which in a specific embodiment 
are the human carboxypeptidase M and human mouse double minute 2 homolog genes, as well as 
vectors and hosts containing these fragments and polynucleotide fragments hybridizing to 
noncoding regions, as well as antisense oligonucleotides to these fragments. 

30 

As defined herein, a "gene" is the segment of DNA involved in producing a polypeptide chain; it 
includes regions preceding and following the coding region, as well as intervening sequences 
(introns) between individual coding segments (exons). 

35 As defined herein "isolated" refers to material removed from its original environment and is thus 
altered "by the hand of man" from its natural state. An isolated polynucleotide can be part of a 

4 



vector, a composition of matter or can be contained within a cell as long as the cell is not the 
original environment of the polynucleotide. 

The polynucleotides of the present invention may be in the form of RNA or in the form of DNA, 
5 which DNA includes genomic DNA and synthetic DNA. The DNA may be double-stranded or 
single-stranded and if single stranded may be the coding strand or non-coding strand. 

The genes encoding human carboxypeptidase M and the human mouse double minute 2 homolog 
are disposed in the chromosome 12 genomic clone of accession number AC025423, 150579 base 
10 pairs, at, respectively, nucleotides 1 - 99860 and 99541 - 150579. 

The polynucleotides of the invention have at least a 95% identity and may have a 96%, 97%, 98% 
or 99% identity to the polynucleotides depicted in SEQ ID NOS:3 or 4 as well as the 
polynucleotides in reverse sense orientation, or the polynucleotide sequences encoding the human 
15 carboxypeptidase M or human mouse double minute 2 homolog polypeptides depicted in SEQ ID 
NOS:l or 2 respectively. 

A polynucleotide having 95% "identity" to a reference nucleotide sequence of the present 
invention, is identical to the reference sequence except that the polynucleotide sequence may 

20 include, on average, up to five point mutations per each 100 nucleotides of the reference nucleotide 
sequence encoding the polypeptide. In other words, to obtain a polynucleotide having a nucleotide 
sequence at least 95% identical to a reference nucleotide sequence, up to 5% of the nucleotides in 
the reference sequence may be deleted or substituted with another nucleotide, or a number of 
nucleotides up to 5% of the total nucleotides in the reference sequence may be inserted into the 

25 reference sequence. The query sequence may be an entire sequence, the ORF (open reading 
frame), or any fragment specified as described herein. 

As a practical matter, whether any particular nucleic acid molecule or polypeptide is at least 90%, 
95%, 96%, 97%, 98% or 99% identical to a nucleotide sequence of the presence invention can be 

30 determined conventionally using known computer programs. A preferred method for determining 
the best overall match between a query sequence (a sequence of the present invention) and a subject 
sequence, also referred to as a global sequence alignment, can be determined using the FASTDB 
computer program based on the algorithm of Brutlag et al. (Comp. App. Biosci. (1990) 6:237- 
245). In a sequence alignment the query and subject sequences are both DNA sequences. An 

35 RNA sequence can be compared by converting U's to T's. The result of said global sequence 
alignment is in percent identity. Preferred parameters used in a FASTDB alignment of DNA 
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sequences to calculate percent identity are: Matrix=Unitary, k-tuple=4, Mismatch Penalty=l, 
Joining Penalty=30, Randomization Group Length=0, Cutoff Score=l, Gap Penalty=5, Gap Size 
Penalty=0.05, Window Size=500 or the length of the subject nucleotide sequence, whichever is 
shorter. 

5 

If the subject sequence is shorter than the query sequence because of 5' or 3' deletions, not 
because of internal deletions, a manual correction must be made to the results. This is because the 
FASTDB program does not account for 5' and 3' truncations of the subject sequence when 
calculating percent identity. For subject sequences truncated at the 5' or 3' ends, relative to the 

10 query sequence, the percent identity is corrected by calculating the number of bases of the query 
sequence that are 5' and 3' of the subject sequence, which are not matched/aligned, as a percent of 
the total bases of the query sequence. Whether a nucleotide is matched/aligned is determined by 
results of the FASTDB sequence alignment. This percentage is then subtracted from the percent 
identify, calculated by the above FASTDB program using the specified parameters, to arrive at a 

15 final percent identity score. This corrected score is what is used for the purposes of the present 
invention. Only bases outside the 5' and 3' bases of the subject sequence, as displayed by the 
FASTDB alignment, which are not matched/aligned with the query sequence are calculated for the 
purposes of manually adjusting the percent identity score. 

20 For example, a 90 base subject sequence is aligned to a 100 base query sequence to determine 
percent identity. The deletions occur at the 5' end of the subject sequence and therefore, the 
FASTDB alignment does not show a matched/alignment of the first 10 bases at 5' end. The 10 
unpaired bases represent 10% of the sequence (number of bases at the 5' and 3' ends not 
matched/total numbers of bases in the query sequence) so 10% is subtracted from the percent 

25 identity score calculated by the FASTDB program. If the remaining 90 bases were perfectly 

matched the final percent identity would be 90%. In another example, a 90 base subject sequence 
is compared with a 100 base query sequence. This time the deletions are internal deletions so that 
there are no bases on the 5' or 3' of the subject sequence which are not matched/aligned with the 
query. In this case the percent identity calculated by FASTDB is not manually corrected. Once 

30 again, only bases 5' and 3' of the subject sequence which are not matched/aligned with the query 
sequence are manually corrected for. No other manual corrections are made for purposes of the 
present invention. 

A polypeptide that has an amino acid sequence at least, for example, 95% "identical" to a query 
35 amino acid sequence is identical to the query sequence except that the subject polypeptide sequence 
may include on average, up to five amino acid alterations per each 100 amino acids of the query 
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amino acid sequence. In other words, to obtain a polypeptide having an amino acid sequence at 
least 95% identical to a query amino acid sequence, up to 5% of the amino acid residues in the 
subject sequence may be inserted, deleted, (indels) or substituted with another amino acid. These 
alterations of the reference sequence may occur at the amino or carboxy terminal positions of the 
5 reference amino acid sequence or anywhere between those terminal positions, interspersed either 
individually among residues in the referenced sequence or in one or more contiguous groups 
within the reference sequence. 

A preferred method for determining the best overall match between a query sequence (a sequence 
10 of the present invention) and a subject sequence, also referred to as a global sequence alignment, 
can be determined using the FASTDB computer program based on the algorithm of Brutlag et al. 
(Com. App. Biosci. (1990) 6:237-245). In a sequence alignment, the query and subject sequence 
are either both nucleotide sequences or both amino acid sequences. The result of said global 
sequence alignment is in percent identity. Preferred parameters used in a FASTDB amino acid 
15 alignment are: Matrix=PAM 0, k-tuple=2, Mismatch Penalty=l, Joining Penal ty=20, 
Randomization Group Length=0, Cutoff Score=l, Window Size=sequence length, Gap 
Penalty=5, Gap Size Penalty=0.05, Window Size=500 or the length of the subject amino acid 
sequence, whichever is shorter. 

20 If the subject sequence is shorter than the query sequence due to N- or C- terminal deletions, not 
because of internal deletions, a manual correction must be made to the results. This is because the 
FASTDB program does not account for N- and C- terminal truncations of the subject sequence 
when calculating global percent identity. For subject sequences truncated at the N- and C-termini, 
relative to the query sequence, the percent identity is corrected by calculating the number of 

25 residues of the query sequence that are N- and C-terminal of the subject sequence, which are not 
matched/aligned with a corresponding subject residue, as a percent of the total bases of the query 
sequence. Whether a residue is matched/aligned is determined by results of the FASTDB sequence 
alignment. This percentage is then subtracted from the percent identity, calculated by the above 
FASTDB program using the specified parameters, to arrive at a final percent identity score. This 

30 final percent identity score is what is used for the purposes of the present invention. Only residues 
to the N- and C-termini of the subject sequence, which are not matched/aligned with the query 
sequence, are considered for the purposes of manually adjusting the percent identity score. That is, 
only query residue positions outside the farthest N- and C-terminal residues of the subject 
sequence. 

35 

The invention also encompasses polynucleotides that hybridize to the polynucleotides depicted in 
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SEQ ID NOS: 3 or 4. A polynucleotide "hybridizes" to another polynucleotide, when a single- 
stranded form of the polynucleotide can anneal to the other polynucleotide under 
the appropriate conditions of temperature and solution ionic strength (see Sambrook et al., supra). 
The conditions of temperature and ionic strength determine the "stringency" of the hybridization. 
5 For preliminary screening for homologous nucleic acids, low stringency hybridization conditions, 

corresponding to a temperature of 42°C, can be used, e.g., 5X SSC, 0.1% SDS, 0.25% milk, and 

no formamide; or 40% formamide, 5X SSC, 0.5% SDS). Moderate stringency hybridization 

conditions correspond to a higher temperature of 55°C, e.g., 40% formamide, with 5X or 6X 

SCC. High stringency hybridization conditions correspond to the highest temperature of 65°C, 

10 e.g., 50 % formamide, 5X or 6X SCC. Hybridization requires that the two nucleic acids contain 
complementary sequences, although depending on the stringency of the hybridization, mismatches 
between bases are possible. The appropriate stringency for hybridizing nucleic acids depends on 
the length of the nucleic acids and the degree of complementation, variables well known in the art. 
The greater the degree of similarity or homology between two nucleotide sequences, the greater the 

15 value of T m for hybrids of nucleic acids having those sequences. The relative stability 

(corresponding to higher T m ) of nucleic acid hybridizations decreases in the following order: 
RNArRNA, DNA:RNA, DNA:DNA. 

Polynucleotide and polypeptide variants 

20 The invention is directed to both polynucleotide and polypeptide variants. A "variant" refers to a 
polynucleotide or polypeptide differing from the polynucleotide or polypeptide of the present 
invention, but retaining essential properties thereof. Generally, variants are overall closely similar 
and in many regions, identical to the polynucleotide or polypeptide of the present invention. 

25 The variants may contain alterations in the coding regions, non-coding regions, or both. 
Especially preferred are polynucleotide variants containing alterations which produce silent 
substitutions, additions, or deletions, but do not alter the properties or activities of the encoded 
polypeptide. Nucleotide variants produced by silent substitutions due to the degeneracy of the 
genetic code are preferred. Moreover, variants in which 5-10, 1-5, or 1-2 amino acids are 

30 substituted, deleted, or added in any combination are also preferred. 

The term "variant" also encompasses allelic variants of said polynucleotides. An allelic variant 
denotes any of two or more alternative forms of a gene occupying the same chromosomal locus. 
Allelic variation arises naturally through mutation, and may result in polymorphism within 
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populations. Gene mutations can be silent (no change in the encoded polypeptide) or may encode 
polypeptides having altered amino acid sequences. An allelic variant of a polypeptide is a 
polypeptide encoded by an allelic variant of a gene. 

5 The term "variant" also encompasses naturally occurring variants such as single nucleotide 
polymorphisms (SNPs). 

The amino acid sequences of the variant polypeptides may differ from the amino acid sequences 
depicted in SEQ ID NOS: 1 or 2 by an insertion or deletion of one or more amino acid residues 

10 and/or the substitution of one or more amino acid residues by different amino acid residues. 

Preferably, amino acid changes are of a minor nature, that is conservative amino acid substitutions 
that do not significantly affect the folding and/or activity of the protein; small deletions, typically of 
one to about 30 amino acids; small amino- or carboxyl-terminal extensions, such as an amino- 
terminal methionine residue; a small linker peptide of up to about 20-25 residues; or a small 

1 5 extension that facilitates purification by changing net charge or another function, such as a poly- 
histidine tract, an antigenic epitope or a binding domain. 

Examples of conservative substitutions are within the group of basic amino acids (arginine, lysine 
and histidine), acidic amino acids (glutamic acid and aspartic acid), polar amino acids (glutamine 

20 and asparagine), hydrophobic amino acids (leucine, isoleucine and valine), aromatic amino acids 
(phenylalanine, tryptophan and tyrosine), and small amino acids (glycine, alanine, serine, 
threonine and methionine). Amino acid substitutions which do not generally alter the specific 
activity are known in the art and are described, for example, by H. Neurath and R.L. Hill, 1979, 
In, The Proteins, Academic Press, New York. The most commonly occurring exchanges are 

25 Ala/Ser, Val/Ile, Asp/Glu, Thr/Ser, Ala/Gly, Ala/Thr, Ser/Asn, Ala/Val, Ser/Gly, Tyr/Phe, 
Ala/Pro, Lys/Arg, Asp/Asn, Leu/Ile, Leu/Val, as well as these in reverse. 

Noncoding Regions 

The invention is further directed to polynucleotide fragments containing or hybridizing to 
30 noncoding regions of the human carboxypeptidase M or human mouse double minute 2 homolog 
genes. These include but are not limited to an expression control element, an intron, a 5'- non- 
coding region, a 3'- non-coding region and splice junctions (see Tables 1-2, as well as 
transcription factor binding sites (see Table 3). The polynucleotide fragments may be a short 
polynucleotide fragment which is between about 20 nucleotides to about 50 nucleotides in length. 
35 Such shorter fragments may be useful for diagnostic purposes. Such short polynucleotide 
fragments are also preferred with respect to polynucleotides containing or hybridizing to 
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polynucleotides containing splice junctions. Alternatively larger fragments, e.g., of about 50, 150, 
500, 600, 2000 or about 5000 nucleotides in length may be used. 



TABLE 1. EXON/INTRON ORGANIZATION OF THE HUMAN CARBOXY- 
5 PEPTIDASE M GENE (cDNA ACCESSION NO. XM_006768) IN SEQ ID NO: 3, 
99680 BASE PAIRS; NUCLEOTIDES 1 - 99680 IN THE GENOMIC CLONE OF 
ACCESSION NO. AC025423 (FORWARD STRAND CODING). 



EXON NUCLEOTIDE NO. AMINO ACID NO. 

10 



1 


16641 - 


16796 


1-52 


2 


63585 - 


63686 


53 - 86 


3 


77522 - 


77692 


87 - 143 


4 


79077 - 


79262 


144 - 205 


5 


79982 - 


80152 


206 - 262 


6 


82429 - 


82581 


263 - 313 


7 


90406 - 


90555 


314 - 363 


8 


92799 - 


93038 


364 - 443 


STOP CODON 


93039 - 


93041 





20 



TABLE 2. EXON/INTRON ORGANIZATION OF THE HUMAN MOUSE 
DOUBLE MINUTE 2 HOMOLOG GENE (VARIANT OF ACCESSION NO. 
NM_002392) IN SEQ ID NO:4, 51039 BASE PAIRS; NUCLEOTIDES 99541 - 
25 150579 IN THE GENOMIC CLONE OF ACCESSION NO. AC025423 
(REVERSE STRAND CODING). 



EXON NUCLEOTIDE NO. AMINO ACID NO. 



30 


STOP CODON 


10089 




10091 






10 


10092 




10664 


491 - 301 




9 


13189 




13266 


300 - 275 




8 


13954 




14109 


274 - 223 


35 


7 


21007 




21168 


222 - 169 




6 


25288 




25383 


168 - 137 




5 


25508 




25576 


136 - 114 




4 


29565 




29615 


113 - 97 




3 


32995 




33126 


96 - 53 


40 


2 


36310 




36384 


52 - 28 




1 


40646 




40726 


27 - 1 



TABLE 3: TRANSCRIPTION FACTOR BINDING SITES ON GENES THAT 
ENCODE CARBOXYPEPTIDASE M (CpM) AND THE HUMAN HOMOLOG OF 
45 MOUSE DOUBLE MINUTE 2 (huMDM2) 

BINDING SITES CpM huMDM2 

AP1FJ_Q2 60 25 
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API C 


16 


11 




API Q2 


39 


13 




API Q4 


24 


12 




AP4 Q5 


47 


27 


5 


AP4 Q6 


22 


14 




ARNT 01 




4 




BRN2 01 


29 


6 




CAAT 01 


7 


4 




CDPCR3HD 01 


19 


7 


10 


CEBPB_01 


26 


6 




CMYB 01 


7 






CREL 01 


15 


4 




DELTAEF1 01 


196 


98 




FREAC7 01 


30 


29 


15 


GATA1 02 


40 


25 




GATA1_03 


63 


21 




GATA1 04 


109 


46 




GATA1_05 


21 


13 


20 


GATA1 06 


33 


26 




GATA2 02 


59 


35 




GATA2 03 


20 


19 




GATA3 02 


30 


23 




GATA3 03 


18 


20 


25 


GATA_C 


61 


15 




GFII 01 


23 


8 




HFH2 01 


20 


13 




HFH3 01 


32 


13 


30 


HFH8 01 


23 


7 




HNF3B 01 


10 


7 




IK1 01 


12 






IK2 01 


216 


63 




LM02COM 01 


86 


23 


35 


LM02COM 02 


85 


23 




LYF1.01 


45 


41 




1\yf A V A"! 

MAX 01 


o 
8 


4 




MYCMAX 02 


8 






MYOD 01 


5 




40 


MYOD Q6 


49 


21 




MZF1 01 


187 


61 




NF1_Q6 


10 


5 




NFAT Q6 


134 


71 


45 


NFY Q6 


16 






NKX25 01 


48 


35 




NKX25 02 


30 


9 




NMYC 01 


16 


10 




OCT1 01 


3 




50 


OCT1 02 


6 






OCT1 06 


3 






OCT1 07 


5 






OCT1 Q6 


5 






RORA1 01 


8 


9 



11 



S8 01 


183 


128 


SOX5_01 


76 


29 


SRY 02 


38 


27 


STAT 01 


11 




TATA 01 


28 


22 


TATA C 


20 


8 


TCF11 01 


182 


51 


USF 01 


16 


10 


USF C 


16 


10 


VMYB 02 


7 


11 


XFD2 01 


11 


8 
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In a specific embodiment, such noncoding sequences are expression control sequences. These 
include but are not limited to DNA regulatory sequences, such as promoters, enhancers, 
repressors, terminators, and the like, that provide for the regulation of expression of a coding 
sequence in a host cell. In eukaryotic cells, polyadenylation signals are also control sequences. 

20 

In a more specific embodiment of the invention, the expression control sequences may be 
operatively linked to a polynucleotide encoding a heterologous polypeptide. Such expression 
control sequences may be about 50-200 nucleotides in length and specifically about 50, 100, 200, 
500, 600, 1000 or 2000 nucleotides in length. A transcriptional control sequence is "operatively 

25 linked" to a polynucleotide encoding a heterologous polypeptide sequence when the expression 
control sequence controls and regulates the transcription and translation of that polynucleotide 
sequence. The term "operatively linked" includes having an appropriate start signal (e.g., ATG) in 
front of the polynucleotide sequence to be expressed and maintaining the correct reading frame to 
permit expression of the DNA sequence under the control of the expression control sequence and 

30 production of the desired product encoded by the polynucleotide sequence. If a gene that one 
desires to insert into a recombinant DNA molecule does not contain an appropriate start signal, 
such a start signal can be inserted upstream (5') of and in reading frame with the gene. 

The invention is further directed to antisense oligonucleotides and mimetics to these polynucleotide 
35 sequences. Antisense technology can be used to control gene expression through triple-helix 
formation or antisense DNA or RNA, both of which methods are based on binding of a 
polynucleotide to DNA or RNA. For example, the 5' coding portion of the polynucleotide 
sequence, which encodes the mature polypeptides of the present invention, is used to design an 
antisense RNA oligonucleotide of from about 10 to 40 base pairs in length. A DNA oligonucleotide 
40 is designed to be complementary to a region of the gene involved in transcription or RNA 
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processing (triple helix (see Lee et al., Nucl. Acids Res., 6:3073 (1979); Cooney et al, Science, 
241:456 (1988); and Dervan et al., Science, 251: 1360 (1991)), thereby preventing transcription 
and the production of said polypeptides. 

5 Expression of Polypeptides 

Isolated Polynucleotide Sequences 

The human chromosome 12 genomic clone of accession number AC025423 has been discovered to 
contain the human carboxypeptidase M gene and the human mouse double minute 2 homolog gene 
10 by Genscan analysis (Burge et al., 1997, J. Mol. Biol. 268:78-94), BLAST2 and TBLASTN 
analysis (Altschul et al., 1997, Nucl. Acids Res. 25:3389-3402), in which the sequence of 
AC025423 was compared to the human carboxypeptidase M cDNA sequence, accession number 
XM_006768 and the human mouse double minute 2 homolog cDNA sequence accession number 
NM_002392, one of several splice variants. 

15 

The cloning of the nucleic acid sequences of the present invention from such genomic DNA can be 
effected, e.g., by using the well known polymerase chain reaction (PCR) or antibody screening of 
expression libraries to detect cloned DNA fragments with shared structural features. See, e.g., 
Innis et al, 1990, PCR: A Guide to Methods and Application, Academic Press, New York. Other 

20 nucleic acid amplification procedures such as ligase chain reaction (LCR), ligated activated 

transcription (LAT) and nucleic acid sequence-based amplification (NASB A) or long range PCR 
may be used. In a specific embodiment, 5'- or 3'- non-coding portions of each gene may be 
identified by methods including but are not limited to, filter probing, clone enrichment using 
specific probes and protocols similar or identical to 5'- and 3'- "RACE" protocols which are well 

25 known in the art. For instance, a method similar to 5'- RACE is available for generating the 

missing 5'- end of a desired full-length transcript. (Fromont-Racine et al., 1993, Nucl. Acids Res. 
21:1683-1684). 

Once the DNA fragments are generated, identification of the specific DNA fragment containing the 
30 desired human carboxypeptidase M gene or the human mouse double minute 2 homolog gene may 
be accomplished in a number of ways. For example, if an amount of a portion of a human 
carboxypeptidase M gene or the human mouse double minute 2 homolog gene or its specific RNA, 
or a fragment thereof, is available and can be purified and labeled, the generated DNA fragments 
may be screened by nucleic acid hybridization to the labeled probe (Benton and Davis, 1977, 
35 Science 196:180; Grunstein and Hogness, 1975, Proc. Natl. Acad. Sci. U.S.A.72:3961). The 
present invention provides such nucleic acid probes, which can be conveniently prepared from the 



13 



specific sequences disclosed herein, e.g., a hybridizable probe having a nucleotide sequence 
corresponding to at least a 15 and preferably 40, nucleotide fragment of the sequences depicted in 
SEQ ED NOS:3 or 4. Preferably, a fragment is selected that is highly unique to the polypeptides of 
the invention. Those DNA fragments with substantial homology to the probe will hybridize. As 

5 noted above, the greater the degree of homology, the more stringent hybridization conditions can 
be used. In one embodiment, low stringency hybridization conditions are used to identify a 
homologous human carboxypeptidase M or human mouse double minute 2 homolog 
polynucleotide. However, in a preferred aspect, and as demonstrated experimentally herein, a 
nucleic acid encoding a polypeptide of the invention will hybridize to a nucleic acid derived from 

10 the polynucleotide sequence depicted in SEQ ID NOS:3 or 4 or a hybridizable fragment thereof, 
under moderately stringent conditions; more preferably, it will hybridize under high stringency 
conditions. 

Alternatively, the presence of the gene may be detected by assays based on the physical, chemical, 
15 or immunological properties of its expressed product. For example, cDNA clones, or DNA clones 
which hybrid-select the proper mRNAs, can be selected which produce a protein that, e.g., has 
similar or identical electrophoretic migration, isoelectric focusing behavior, proteolytic digestion 
maps, or antigenic properties as known for the human carboxypeptidase M or human mouse 
double minute 2 homolog polypeptide. 

20 

A gene encoding human carboxypeptidase M or human mouse double minute 2 homolog 
polypeptide can also be identified by mRNA selection, i.e., by nucleic acid hybridization followed 
by in vitro translation. In this procedure, fragments are used to isolate complementary mRNAs by 
hybridization. Immunoprecipitation analysis or functional assays of the in vitro translation 
25 products of the products of the isolated mRNAs identifies the mRNA and, therefore, the 
complementary DNA fragments, that contain the desired sequences. 

Nucleic Acid Constructs 

The present invention also relates to nucleic acid constructs comprising a polynucleotide sequence 
30 containing the exon/intron segments of the human carboxypeptidase M gene (nucleotides 1 - 99680 
of SEQ ID NO:3) or human mouse double minute 2 homolog gene (nucleotides 1 - 51039 of SEQ 
ID NO:4) operably linked to one or more control sequences which direct the expression of the 
coding sequence in a suitable host cell under conditions compatible with the control sequences. 
Expression will be understood to include any step involved in the production of the polypeptide 
35 including, but not limited to, transcription, post-transcriptional modification, translation, post- 
radiational modification, and secretion. 
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The invention is further directed to a nucleic acid construct comprising expression control 
sequences derived from SEQ ID NOS: 3 or 4 and a heterologous polynucleotide sequence. 

5 "Nucleic acid construct" is defined herein as a nucleic acid molecule, either single- or double- 
stranded, which is isolated from a naturally occurring gene or which has been modified to contain 
segments of nucleic acid which are combined and juxtaposed in a manner which would not 
otherwise exist in nature. The term nucleic acid construct is synonymous with the term expression 
cassette when the nucleic acid construct contains all the control sequences required for expression 

10 of a coding sequence of the present invention. The term "coding sequence" is defined herein as a 
portion of a nucleic acid sequence which directly specifies the amino acid sequence of its protein 
product. The boundaries of the coding sequence are generally determined by a ribosome binding 
site (prokaryotes) or by the ATG start codon (eukaryotes) located just upstream of the open reading 
frame at the 5'- end of the mRNA and a transcription terminator sequence located just downstream 

15 of the open reading frame at the 3'- end of the mRNA. A coding sequence can include, but is not 
limited to, DNA, cDNA, and recombinant nucleic acid sequences. 

The isolated polynucleotide of the present invention may be manipulated in a variety of ways to 
provide for expression of the polypeptide. Manipulation of the nucleic acid sequence prior to its 
20 insertion into a vector may be desirable or necessary depending on the expression vector. The 
techniques for modifying nucleic acid sequences utilizing recombinant DNA methods are well 
known in the art. 

The control sequence may be an appropriate promoter sequence, a nucleic acid sequence which is 
25 recognized by a host cell for expression of the nucleic acid sequence. The promoter sequence 
contains transcriptional control sequences that regulate the expression of the polynucleotide. The 
promoter may be any nucleic acid sequence that shows transcriptional activity in the host cell of 
choice including mutant, truncated, and hybrid promoters, and may be obtained from genes 
encoding extracellular or intracellular polypeptides either homologous or heterologous to the host 
30 cell. 

Examples of suitable promoters for directing the transcription of the nucleic acid constructs of the 
present invention, especially in a bacterial host cell, are the promoters obtained from the £. coli lac 
operon, the Streptomyces coelicolor agarase gene (dagA), the Bacillus subtilis levansucrase gene 
35 (sacB), the Bacillus licheniformis alpha-amylase gene (amyL), the Bacillus stearothermophilus 

maltogenic amylase gene (amyM), the Bacillus amyloliquefaciens alpha-amylase gene (amyQ), the 
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Bacillus licheniformis penicillinase gene (penP), the Bacillus subtilis xylA and xylB genes, and the 
prokaryotic beta-lactamase gene (Villa-Komaroff et ai, 1978, Proceedings of the National 
Academy of Sciences USA 75: 3727-3731), as well as the tac promoter (DeBoer etal, 1983, 
Proceedings of the National Academy of Sciences USA 80: 21-25). Further promoters are 
5 described in "Useful proteins from recombinant bacteria" in Scientific American, 1980, 242: 74- 
94; and in Sambrook etal., 1989, supra. 

I 

Examples of suitable promoters for directing the transcription of the nucleic acid constructs of the 
present invention in a filamentous fungal host cell are promoters obtained from the genes encoding 

10 Aspergillus oryzae TAKA amylase, Rhizomucor miehei aspartic proteinase, Aspergillus niger 
neutral alpha-amylase, Aspergillus niger acid stable alpha-amylase, Aspergillus niger or 
Aspergillus awamori glucoamylase (glaA), Rhizomucor miehei lipase, Aspergillus oryzae alkaline 
protease, Aspergillus oryzae triose phosphate isomerase, Aspergillus nidulans acetamidase, 
Fusarium oxysporum trypsin-like protease (WO 96/00787), NA2-tpi (a hybrid of the promoters 

15 from the genes encoding Aspergillus niger neutral alpha-amylase and Aspergillus oryzae triose 
phosphate isomerase), and mutant, truncated, and hybrid promoters thereof. 

In a yeast host, useful promoters are obtained from the Saccharomyces cerevisiae enolase (ENO-1) 
gene, the Saccharomyces cerevisiae galactokinase gene (GAL1), the Saccharomyces cerevisiae 
20 alcohol dehydrogenase/glyceraldehyde-3-phosphate dehydrogenase genes (ADH2/GAP), and the 
Saccharomyces cerevisiae 3-phosphoglycerate kinase gene. Other useful promoters for yeast host 
cells are described by Romanos et ai, 1992, Yeast 8: 423-488. 

The control sequence may also be a suitable transcription terminator sequence, a sequence 
25 recognized by a host cell to terminate transcription. The terminator sequence is operably linked to 

the 3'- terminus of the nucleic acid sequence encoding the polypeptide. Any terminator which is 

functional in the host cell of choice may be used in the present invention. 

Preferred terminators for filamentous fungal host cells are obtained from the genes encoding 

Aspergillus oryzae TAKA amylase, Aspergillus niger glucoamylase, Aspergillus nidulans 
30 anthranilate synthase, Aspergillus niger alpha-glucosidase, and Fusarium oxysporum trypsin-like 

protease. 

Preferred terminators for yeast host cells are obtained from the genes encoding Saccharomyces 
cerevisiae enolase, Saccharomyces cerevisiae cytochrome C (CYC1), or Saccharomyces cerevisiae 
35 glyceraldehyde-3-phosphate dehydrogenase. Other useful terminators for yeast host cells are 
described by Romanos et ai, 1992, supra. 



16 



The control sequence may also be a suitable leader sequence, a nontranslated region of an mRNA 
which is important for translation by the host cell. The leader sequence is operably linked to the 
5'- terminus of the nucleic acid sequence encoding the polypeptide. Any leader sequence that is 
5 functional in the host cell of choice may be used in the present invention. 

Preferred leaders for filamentous fungal host cells are obtained from the genes encoding 
Aspergillus oryzae TAKA amylase and Aspergillus nidulans triose phosphate isomerase. 

10 Suitable leaders for yeast host cells are obtained from the Saccharomyces cerevisiae enolase (ENO- 
1) gene, the Saccharomyces cerevisiae 3-phosphoglycerate kinase gene, the Saccharomyces 
cerevisiae alpha-factor, and the Saccharomyces cerevisiae alcohol dehydrogenase/glyceraldehyde- 
3-phosphate dehydrogenase genes (ADH2/GAP). 

15 The control sequence may also be a polyadenylation sequence, a sequence which is operably linked 
to the 3'- terminus of the nucleic acid sequence and which, when transcribed, is recognized by the 
host cell as a signal to add polyadenosine residues to transcribed mRNA. Any polyadenylation 
sequence which is functional in the host cell of choice may be used in the present invention. 

20 Preferred polyadenylation sequences for filamentous fungal host cells are obtained from the genes 
encoding Aspergillus oryzae TAKA amylase, Aspergillus niger glucoamylase, Aspergillus 
nidulans anthranilate synthase, and Aspergillus niger alpha-glucosidase. 

Useful polyadenylation sequences for yeast host cells are described by Guo and Sherman, 1995, 
25 Molecular Cellular Biology 15: 5983-5990. 

The control sequence may also be a signal peptide coding region, which codes for an amino acid 
sequence linked to the amino terminus of the polypeptide which can direct the encoded polypeptide 
into the cell's secretory pathway. The 5'- end of the coding sequence of the nucleic acid sequence 

30 may inherently contain a signal peptide coding region naturally linked in translation reading frame 
with the segment of the coding region which encodes the secreted polypeptide. Alternatively, the 
5'- end of the coding sequence may contain a signal peptide coding region which is foreign to the 
coding sequence. The foreign signal peptide coding region may be required where the coding 
sequence does not normally contain a signal peptide coding region. Alternatively, the foreign 

35 signal peptide coding region'may simply replace the natural signal peptide coding region in order to 
obtain enhanced secretion of the polypeptide. However, any signal peptide coding region which 
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directs the expressed polypeptide into the secretory pathway of a host cell of choice may be used in 
the present invention. 



An effective signal peptide coding region for bacterial host cells is the signal peptide coding region 
5 obtained from the maltogenic amylase gene from Bacillus NCBB 1 1837, the Bacillus 

stearothermophilus alpha-amylase gene, the Bacillus licheniformis subtilisin gene, the Bacillus 
licheniformis beta-lactamase gene, the Bacillus stearothermophilus neutral proteases genes {nprT, 
nprS, nprM), or the Bacillus subtilis prsA gene. Further signal peptides are described by Simonen 
andPalva, 1993, Microbiological Reviews 57: 109-137. 

10 

An effective signal peptide coding region for filamentous fungal host cells is the signal peptide 
coding region obtained from the Aspergillus oryzae TAKA amylase gene, Aspergillus niger neutral 
amylase gene, Aspergillus niger glucoamylase gene, Rhizomucor miehei aspartic proteinase gene, 
Humicola lanuginosa cellulase gene, or Humicola lanuginosa lipase gene. V/ 

15 

Useful signal peptides for yeast host cells are obtained from the genes for Saccharomyces 
cerevisiae alpha-factor and Saccharomyces cerevisiae invertase. Other useful signal peptide coding 
regions are described by Romanos et aL, 1992, supra. 

20 The control sequence may also be a propeptide coding region, which codes for an amino acid 

sequence positioned at the amino terminus of a polypeptide. The resultant polypeptide is known as 
a proenzyme or propolypeptide (or a zymogen in some cases). A propolypeptide is generally 
inactive and can be converted to a mature active polypeptide by catalytic or autocatalytic cleavage of 
the propeptide from the propolypeptide. The propeptide coding region may be obtained from the 

25 Bacillus subtilis alkaline protease gene (aprE), the Bacillus subtilis neutral protease gene (nprT), 
the Saccharomyces cerevisiae alpha-factor gene, the Rhizomucor miehei aspartic proteinase gene, 
or the Myceliophthora thermophila laccase gene (WO 95/33836). 

Where both signal peptide and propeptide regions are present at the amino terminus of a 
30 polypeptide, the propeptide region is positioned next to the amino terminus of a polypeptide and 
the signal peptide region is positioned next to the amino terminus of the propeptide region. 

It may also be desirable to add regulatory sequences that allow the regulation of the expression of 
the polypeptide relative to the growth of the host cell. Examples of regulatory systems are those 
35 which cause the expression of the gene to be turned on or off in response to a chemical or physical 
stimulus, including the presence of a regulatory compound. Regulatory systems in prokaryotic 
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systems would include the lac, tac, and trp operator systems. In yeast, the ADH2 system or GAL1 
system may be used. In filamentous fungi, the TAKA alpha-amylase promoter, Aspergillus niger 
glucoamylase promoter, and the Aspergillus oryzae glucoamylase promoter may be used as 
regulatory sequences. Other examples of regulatory sequences are those which allow for gene 
5 amplification. In eukaryotic systems, these include the dihydrofolate reductase gene which is 
amplified in the presence of methotrexate, and the metallothionein genes which are amplified with 
heavy metals. In these cases, the nucleic acid sequence encoding the polypeptide would be 
operably linked with the regulatory sequence. 

10 Expression Vectors 

The present invention also relates to recombinant expression vectors comprising a nucleic acid 
sequence of the present invention, a promoter, and transcriptional and translational stop signals. 
The various nucleic acid and control sequences described above may be joined together to produce 
a recombinant expression vector which may include one or more convenient restriction sites to 

15 allow for insertion or substitution of the nucleic acid sequence encoding the polypeptide at such 
sites. Alternatively, the polynucleotide of the present invention may be expressed by inserting the 
nucleic acid sequence or a nucleic acid construct comprising the sequence into an appropriate vector 
for expression. In creating the expression vector, the coding sequence is located in the vector so 
that the coding sequence is operably linked with the appropriate control sequences for expression. 

20 

The recombinant expression vector may be any vector {e.g., a plasmid or virus) which can be 
conveniently subjected to recombinant DNA procedures and can bring about the expression of the 
nucleic acid sequence. The choice of the vector will typically depend on the compatibility of the 
vector with the host cell into which the vector is to be introduced. The vectors may be linear or 
25 closed circular plasmids. 

The vector may be an autonomously replicating vector, i.e., a vector which exists as an 
extrachromosomal entity, the replication of which is independent of chromosomal replication, e.g., 
a plasmid, an extrachromosomal element, a minichromosome, or an artificial chromosome. The 
30 vector may contain any means for assuring self-replication. Alternatively, the vector may be one 
which, when introduced into the host cell, is integrated into the genome and replicated together 
with the chromosome(s) into which it has been integrated. Furthermore, a single vector or plasmid 
or two or more vectors or plasmids which together contain the total DNA to be introduced into the 
genome of the host cell, or a transposon may be used. 

35 

The vectors of the present invention preferably contain one or more selectable markers which 

i 
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permit easy selection of transformed cells. A selectable marker is a gene the product of which 
provides for biocide or viral resistance, resistance to heavy metals, prototrophy to auxotrophs, and 
the like. Examples of bacterial selectable markers are the dal genes from Bacillus subtilis or 
Bacillus licheniformis, or markers which confer antibiotic resistance such as ampicillin, 
5 kanamycin, chloramphenicol or tetracycline resistance. Suitable markers for yeast host cells are 
ADE2, HIS3, LEU2, LYS2, MET3, TRP1, and URA3. A selectable marker for use in a 
filamentous fungal host cell may be selected from the group including, but not limited to, amdS 
(acetamidase), argB (ornithine carbamoyltransferase), bar (phosphinothricin acety transferase), 
hygB (hygromycin phosphotransferase), niaD (nitrate reductase), pyrG (orotidine-50-phosphate 
10 decarboxylase), sC (sulfate adenyltransf erase), trpC (anthranilate synthase), as well as equivalents 
from other species. Preferred for use in an Aspergillus cell are the amdS and pyrG genes of 
Aspergillus nidulans or Aspergillus oryzae and the bar gene of Streptomyces hygroscopicus. 

The vectors of the present invention preferably contain an element(s) that permits stable integration 
15 of the vector into the host cell genome or autonomous replication of the vector in the cell 
independent of the genome of the cell. 

For integration into the host cell genome, the vector may rely on the polynucleotide sequence 
encoding the polypeptide or any other element of the vector for stable integration of the vector into 

20 the genome by homologous or nonhomologous recombination. Alternatively, the vector may 
contain additional nucleic acid sequences for directing integration by homologous recombination 
into the genome of the host cell. The additional polynucleotide sequences enable the vector to be 
integrated into the host cell genome at a precise location(s) in the chromosome(s). To increase the 
likelihood of integration at a precise location, the integrational elements should preferably contain a 

25 sufficient number of nucleic acids, such as 100 to 1,500 base pairs, preferably 400 to 1,500 base 
pairs, and most preferably 800 to 1,500 base pairs, which are highly homologous with the 
corresponding target sequence to enhance the probability of homologous recombination. The 
integrational elements may be any sequence that is homologous with the target sequence in the 
genome of the host cell. Furthermore, the integrational elements may be non-encoding or encoding 

30 nucleic acid sequences. On the other hand, the vector may be integrated into the genome of the 
host cell by non-homologous recombination. 

For autonomous replication, the vector may further comprise an origin of replication enabling the 
vector to replicate autonomously in the host cell in question. Examples of bacterial origins of 
35 replication are the origins of replication of plasmids pBR322, pUC19, pACYC177, and 
pACYC184 permitting replication in E. coli, and pUBl 10, pE194, pTA1060, and pAM§l 
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permitting replication in Bacillus. Examples of origins of replication for use in a yeast host cell are 
the 2 micron origin of replication, ARS1, ARS4, the combination of ARS1 and CEN3, and the 
combination of ARS4 and CEN6. The origin of replication may be one having a mutation which 
makes its functioning temperature-sensitive in the host cell (see, e.g., Ehrlich, 1978, Proceedings 
5 of the National Academy of Sciences USA 75: 1433). 

More than one copy of a polynucleotide sequence of the present invention may be inserted into the 
host cell to increase production of the gene product. An increase in the copy number of the 
polynucleotide sequence can be obtained by integrating at least one additional copy of the sequence 
10 into the host cell genome or by including an amplifiable selectable marker gene with the nucleic 
acid sequence where cells containing amplified copies of the selectable marker gene, and thereby 
additional copies of the nucleic acid sequence, can be selected for by cultivating the cells in the 
presence of the appropriate selectable agent. 

1 5 The procedures used to ligate the elements described above to construct the recombinant expression 
vectors of the present invention are well known to one skilled in the art (see, e.g., Sambrook et ah, 
1989, supra). 

Host Cells 

20 The present invention also relates to recombinant host cells, comprising a nucleic acid sequence of 
the invention, which are advantageously used in the recombinant production of the polypeptides. 
A vector comprising a nucleic acid sequence of the present invention is introduced into a host cell 
so that the vector is maintained as a chromosomal integrant or as a self-replicating extra- 
chromosomal vector as described earlier. The term "host cell" encompasses any progeny of a 

25 parent cell that is not identical to the parent cell due to mutations that occur during replication. The 
choice of a host cell will to a large extent depend upon the gene encoding the polypeptide and its 
source. 

The host cell may be a unicellular microorganism, e.g., a prokaryote, or a non-unicellular 
30 microorganism, e.g., a eukaryote. Useful unicellular cells are bacterial cells such as gram positive 
bacteria including, but not limited to, a Bacillus cell, e.g., Bacillus alkalophilus, Bacillus 
amyloliquefaciens, Bacillus brevis, Bacillus circulans, Bacillus clausii, Bacillus coagulans, 
Bacillus lautus, Bacillus lentus, Bacillus licheniformis, Bacillus megaterium, Bacillus 
stearothermophilus, Bacillus subtilis, and Bacillus thuringiensis\ or a Streptomyces cell, e.g., 
35 Streptomyces lividans or Streptomyces murinus, or gram negative bacteria such as E. coli and 
Pseudomonas sp. In a preferred embodiment, the bacterial host cell is a Bacillus lentus, Bacillus 
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licheniformis, Bacillus stearothermophilus or Bacillus subtilis cell. In another preferred 
embodiment, the Bacillus cell is an alkalophilic Bacillus. 

The introduction of a vector into a bacterial host cell may, for instance, be effected by protoplast 
5 transformation (see, e.g., Chang and Cohen, 1979, Molecular General Genetics 168: 1 11-1 15), 
using competent cells (see, e.g., Young and Spizizin, 1961, Journal of Bacteriology 81: 823-829, 
or Dubnau and Davidoff-Abelson, 1971, Journal of Molecular Biology 56: 209-221), 
electroporation (see, e.g., Shigekawa and Dower, 1988, Biotechniques 6: 742-751), or 
conjugation (see, e.g., Koehler and Thorne, 1987, Journal of Bacteriology 169: 5771-5278). 

10 s 

The host cell may be a eukaryote, such as a mammalian cell (e.g., human cell), an insect cell, a 
plant cell or a fungal cell. Mammalian host cells that could be used include but are not limited to 
human Hela, 293, H9 and Jurkat cells, mouse NIH3t3 and C127 cells, Cos 1, Cos 7 and CV1, 
quail QC1-3 cells, mouse L cells and Chinese Hamster ovary (CHO) cells. These cells may be 
15 transfected with a vector containing a transcriptional regulatory sequence, a protein coding 
sequence and transcriptional termination sequences. Alternatively, the polypeptide can be 
expressed in stable cell lines containing the polynucleotide integrated into a chromosome. The co- 
transfection with a selectable marker such as dhfr, gpt, neomycin, hygromycin allows the 
identification and isolation of the transfected cells. 

20 

The host cell may be a fungal cell. "Fungi" as used herein includes the phyla Ascomycota, 
Basidiomycota, Chytridiomycota, and Zygomycota (as defined by Hawksworth et al, In, 
Ainsworth andBisby's Dictionary of The Fungi, 8th edition, 1995, CAB International, University 
Press, Cambridge, UK) as well as the Oomycota (as cited in Hawksworth et al, 1995, supra, 

25 page 171) and all mitosporic fungi (Hawksworth et al, 1995, supra). The fungal host cell may 
also be a yeast cell. "Yeast" as used herein includes ascosporogenous yeast (Endomyce tales), 
basidiosporogenous yeast, and yeast belonging to the Fungi Imperfecti (Blastomycetes). Since the 
classification of yeast may change in the future, for the purposes of this invention, yeast shall be 
defined as described in Biology and Activities of Yeast (Skinner, F.A., Passmore, S.M., and 

30 Davenport, R.R., eds, Soc. App. Bacterioi Symposium Series No. 9, 1980). The fungal host 
cell may also be a filamentous fungal cell. "Filamentous fungi" include all filamentous forms of 
the subdivision Eumycota and Oomycota (as defined by Hawksworth et al., 1995, supra). The 
filamentous fungi are characterized by a mycelial wall composed of chitin, cellulose, glucan, 
chitosan, mannan, and other complex polysaccharides. Vegetative growth is by hyphal elongation 

35 and carbon catabolism is obligately aerobic. In contrast, vegetative growth by yeasts such as 
Saccharomyces cerevisiae is by budding of a unicellular thallus and carbon catabolism may be 
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fermentative. 

Fungal cells may be transformed by a process involving protoplast formation, transformation of 
the protoplasts, and regeneration of the cell wall in a manner known per se. Suitable procedures 
5 for transformation of Aspergillus host cells are described in EP 238 023 and Yelton et al.\ 1984, 
Proceedings of the National Academy of Sciences USA 81 : 1470-1474. Suitable methods for 
transforming Fusarium species are described by Malardier etal, 1989, Gene 78: 147-156 and WO 
96/00787. Yeast may be transformed using the procedures described by Becker and Guarente, In 
Abelson, J.N. and Simon, M.I., editors, Guide to Yeast Genetics and Molecular Biology, 
10 Methods in Enzymology, Volume 194, pp 182-187, Academic Press, Inc., New York; Ito et aL, 
1983, Journal of Bacteriology 153: 163; and Hinnen et aL, 1978, Proceedings of the National 
Academy of Sciences USA 75: 1920. 

Methods of Production 

15 The present invention also relates to methods for producing a polypeptide of the present invention 
comprising (a) cultivating a host cell under conditions conducive for production of the polypeptide; 
and (b) recovering the polypeptide. 

In the production methods of the present invention, the cells are cultivated in a nutrient medium 

20 suitable for production of the polypeptide using methods known in the art. For example, the cell 
may be cultivated by shake flask cultivation, small-scale or large-scale fermentation (including 
continuous, batch, fed-batch, or solid state fermentations) in laboratory or industrial fermentors 
performed in a suitable medium and under conditions allowing the polypeptide to be expressed 
and/or isolated. The cultivation takes place in a suitable nutrient medium comprising carbon and 

25 nitrogen sources and inorganic salts, using procedures known in the art. Suitable media are 
available from commercial suppliers or may be prepared according to published compositions 
(e.g., in catalogues of the American Type Culture Collection). If the polypeptide is secreted into 
the nutrient medium, the polypeptide can be recovered directly from the medium. If the 

* polypeptide is not secreted, it can be recovered from cell lysates. 

30 

The polypeptides may be detected using methods known in the art that are specific for the 
polypeptides. These detection methods may include use of specific antibodies, formation of an 
enzyme product, or disappearance of an enzyme substrate. In a specific embodiment, an enzyme 
assay may be used to determine the activity of the polypeptide. For example, carboxypeptidase M 
35 activity can be determined by measuring the release of the C-terminal arginine of bradykinin or a 
synthetic acyl-dipeptide such as benzoyl-Ala-Arg. The human homolog of mouse double minute 2 
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may be detected by its ability to bind p53. 

The resulting polypeptide may be recovered by methods known in the art. For example, the 
polypeptide may be recovered from the nutrient medium by conventional procedures including, but 
5 not limited to, centrifugation, filtration, extraction, spray-drying, evaporation, or precipitation. 

The polypeptides of the present invention may be purified by a variety of procedures known in the 
art including, but not limited to, chromatography (e.g., ion exchange, affinity, hydrophobic, 
chromatofocusing, and size exclusion), electrophoretic procedures (e.g., preparative isoelectric 
10 focusing), differential solubility (e.g., ammonium sulfate precipitation), SDS-PAGE, or extraction 
(see, e.g., Protein Purification, J.-C. Janson and Lars Ryden, editors, VCH Publishers, New 
York, 1989). 

Antibodies 

15 According to the invention, the human carboxypeptidase M or human mouse double minute 2 
homolog polypeptides produced according to the method of the present invention may be used as 
an immunogen to generate any of these antibodies. Such antibodies include but are not limited to 
polyclonal, monoclonal, chimeric, single chain, Fab fragments, and a Fab expression library. 

20 Various procedures known in the art may be used for the production of antibodies. For the 

production of antibody, various host animals can be immunized by injection with the polypeptide 
thereof, including but not limited to rabbits, mice, rats, sheep, goats, etc. In one embodiment, the 
polypeptide or fragment thereof can optionally be conjugated to an immunogenic carrier, e.g., 
bovine serum albumin (BS A) or keyhole limpet hemocyanin (KLH). Various adjuvants may be 

25 used to increase the immunological response, depending on the host species, including but not 
limited to Freund's (complete and incomplete), mineral gels such as aluminum hydroxide, surface 
active substances such as lysolecithin, pluronic polyols, polyanions, peptides, oil emulsions, 
keyhole limpet hemocyanins, dinitrophenol, and potentially useful human adjuvants such as BCG 
(bacille Calmette-Guerin) and Corynebacterium parvum. 

30 

For preparation of monoclonal antibodies directed toward the human carboxypeptidase M or 
human mouse double minute 2 homolog polypeptide, any technique that provides for the 
production of antibody molecules by continuous cell lines in culture may be used. These include 
but are not limited to the hybridoma technique originally developed by Kohler and Milstein (1975, 
35 Nature 256:495-497), as well as the trioma technique, the human B-cell hybridoma technique 
(Kozbor et al., 1983, Immunology Today 4:72), and the EBV-hybridoma technique to produce 
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human monoclonal antibodies (Cole et al., 1985, in Monoclonal Antibodies and Cancer Therapy, 
Alan R. Liss, Inc., pp. 77-96). In an additional embodiment of the invention, monoclonal 
antibodies can be produced in germ-free animals utilizing recent technology (PCT/US90/02545). 
According to the invention, human antibodies may be used and can be obtained by using human 
5 hybridomas (Cote et al., 1983, Proc. Natl. Acad. Sci. U.S.A. 80:2026-2030) or by transforming 
human B cells with EBV virus in vitro (Cole et al., 1985, in Monoclonal Antibodies and Cancer 
Therapy, Alan R. Liss, pp. 77-96). In fact, according to the invention, techniques developed for 
the production of "chimeric antibodies" (Morrison et al., 1984, J. Bacteriol. 159-870; Neuberger et 
al., 1984, Nature 312:604-608; Takeda et al., 1985, Nature 314:452-454) by splicing the genes 
10 from a mouse antibody molecule specific for the human carboxypeptidase M or human mouse 
double minute 2 homolog polypeptide together with genes from a human antibody molecule of 
appropriate biological activity can be used; such antibodies are within the scope of this invention. 

According to the invention, techniques described for the production of single chain antibodies 
15 (U.S. Pat. No. 4,946,778) can be adapted to produce polypeptide-specific single chain antibodies. 
An additional embodiment of the invention utilizes the techniques described for the construction of 
Fab expression libraries (Huse et al., 1989, Science 246:1275-1281) to allow 
rapid and easy identification of monoclonal Fab fragments with the desired specificity for the 
human carboxypeptidase M or human mouse double minute 2 homolog polypeptide. 

20 

Antibody fragments that contain the idiotype of the antibody molecule can be generated by known 
techniques. For example, such fragments include but are not limited to: the F(ab')2 fragment which 
can be produced by pepsin digestion of the antibody molecule; the Fab' fragments which can be 
generated by reducing the disulfide bridges of the F(ab')2, fragment, 
25 and the Fab fragments which can be generated by treating the antibody molecule with papain and a 
reducing agent. 

In the production of antibodies, screening for the desired antibody can be accomplished by 
techniques known in the art, e.g., radioimmunoassay, ELISA (enzyme-linked immunosorbent 

30 assay), "sandwich" immunoassays, immunoradiometric assays, gel diffusion precipitin reactions, 
immunodiffusion assays, in situ immunoassays (using colloidal gold, enzyme or radioisotope 
labels, for example), western blots, precipitation reactions, agglutination assays (e.g., gel 
agglutination assays, hemagglutination assays), complement fixation assays, immunofluorescence 
assays, protein A assays, and immunoelectrophoresis assays, etc. In one embodiment, antibody 

35 binding is detected by detecting a label on the primary antibody. In another embodiment, the 

primary antibody is detected by detecting binding of a secondary antibody or reagent to the primary 
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antibody. In a further embodiment, the secondary antibody is labeled. Many means are known in 
the art for detecting binding in an immunoassay and are within the scope of the present invention. 
For example, to select antibodies which recognize a specific epitope of a particular polypeptide, 
one may assay generated hybridomas for a product which binds to a particular polypeptide 
5 fragment containing such epitope. For selection of an antibody specific to a particular polypeptide 
from a particular species of animal, one can select on the basis of positive binding with the 
polypeptide expressed by or isolated from cells of that species of animal. 

Immortal, antibody-producing cell lines can also be created by techniques other than fusion, such 
10 as direct transformation of B lymphocytes with oncogenic DNA, or transfection with Epstein-Barr 
virus. See, e.g.,JVL Schreier et al., "Hybridoma Techniques" (1980); Hammerling et al., 
"Monoclonal Antibodies And T-cell Hybridomas" (1981); Kennett et al., "Monoclonal Antibodies" 
(1980); see also U.S. Pat. Nos. 4,341,761; 4,399,121; 4,427,783; 4,444,887; 4,451,570; 
4,466,917; 4,472,500; 4,491,632; 4,493,890. 

15 

Substrate 

In a specific embodiment, the polynucleotides of the present invention, particularly, the 
polynucleotide fragments for hybridizing to non-coding regions of SEQ ID NOS: 3 or 4 may be 
attached to a substrate or reverse complements of said fragments. A substrate may be solid or 
20 porous, planar or non-planar, unitary or distributed. The polynucleotide may be attached 

covalently or applied to a derivatized surface in a chaotropic agent that facilitates denaturation 
and adherence by presumed noncovalent interactions, or some combinations thereof. 

In a more specific embodiment, the substrate is a microarray. "Microarray" as defined herein is a 
25 substrate-bound collection of a plurality nucleic acids, hybridization to each of the plurality of 
bound nucleic acids being separately detectable. The microarray may comprise a plurality of 
polynucleotides hybridizing to a non coding region of SEQ ID NO:3 or 4. Alternatively the 
microarray may comprise a polynucleotide(s) hybridizing to said non-coding region and/or coding 
regions of SEQ ID NO:3 or 4. 

30 

Uses of Polynucleotides 

Diagnostics 

Polynucleotide fragments containing noncoding regions of SEQ ID NO: 3 or 4 may be used as 
35 probes for detecting variants from genomic nucleotide samples from a patient. The variants may 
be allelic variants or substitution, insertion or deletion nucleotide variants. Genomic DNA may 
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be isolated from the patient. Alternatively the polynucleotide fragments may be used to monitor 
expression of SEQ ID NO: 3 or 4 from samples from a patient. A mutation(s) may be detected 
by Southern blot analysis, for example, by hybridizing restriction digested genomic DNA to 
various probes between 10-500 nucleotides in length, preferably between 20-200 nucleotides in 
5 length, more preferably between 20-100 nucleotides in length and most preferably between 20-50 
nucleotides in length and subjecting to agarose electrophoresis. Alternatively, these 
polynucleotides may be used as PCR primers between about 10-100 nucleotides in length and be 
used to amplify the genomic DNA isolated from the patients. Methods for performing primer- 
directed amplification (routine or long range PCR) are well known in the art (see, for example, 

10 PCR Basics: From Background to Bench, Springer Verlag (2000); Gelfand et al., (eds.), PCR 
Strategies, Academic Press (1998)). Single base extension (see, for example, U.S. Patent No. 
6,004,744) may be used to detect SNPs. Additionally, primers may be obtained by routine or 
long range PCR that yield products containing contiguous intron(s)/exon sequence(s) and 
products containing more than one exon with intervening intron(s). The sequence of the amplified 

1 5 genomic DNA from the patient may be determined using methods known in the art. Such probes 
may be between 20-5000 nucleotides in length and may preferably be between 20-50 nucleotides 
in length. 

Thus the invention is directed to kits comprising these polynucleotide probes. In a specific 
20 embodiment, these probes are labeled with a detectable substance. 

In one embodiment, the probes are in solution. In another embodiment, the probes are attached 
to a substrate. In a specific embodiment, the probes are contained within a microarray and are 
separately detectable, 

25 

The probes or primers of the present invention could be used to identify patients with or having 
a propensity for sepsis (SEQ ID NO: 3 -carboxy peptidase M gene) or for sarcoma or leukemias 
(SEQ ID NO:4-human mouse double minute 2 homolog gene). 

30 Antisense Oligonucleotides and Mimetics 

The antisense or reverse complement oligonucleotides or mimetics of the present invention may be 
used to decrease levels of a polypeptide. For example, human carboxypeptidase M has been found 
to form des-Arg9-bradykinin, an agonist of the Bl receptor activated by sepsis. Therefore, the 
human carboxypeptidase M antisense oligonucleotides of the present invention could be used to 

35 inhibit formation of des-Arg9-bradykinin. Human mouse double minute 2 homolog antisense 
sequences may be used to treat sarcomas and leukemias in which the gene is over-expressed. 
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The antisense oligonucleotides of the present invention may be formulated into pharmaceutical 
compositions. These compositions may be administered in a number of ways depending upon 
whether local or systemic treatment is desired and upon the area to be treated. Administration may 
5 be topical (including ophthalmic and to mucous membranes including vaginal and rectal 

delivery), pulmonary, e.g., by inhalation or insufflation of powders or aerosols, including by 
nebulizer; intratracheal, intranasal, epidermal and transdermal), oral or parenteral. Parenteral 
administration includes intravenous, intraarterial, subcutaneous, intraperitoneal or intramuscular 
injection or infusion; or intracranial, e.g., intrathecal or intraventricular, administration. 

10 

Pharmaceutical compositions and formulations for topical administration may include transdermal 
patches, ointments, lotions, creams, gels, drops, suppositories, sprays, liquids and powders. 
Conventional pharmaceutical carriers, aqueous, powder or oily bases, thickeners and the like may 
be necessary or desirable. 

15 

Compositions and formulations for oral administration include powders or granules, suspensions 
or solutions in water or non-aqueous media, capsules, sachets or tablets. Thickeners, flavoring 
agents, diluents, emulsifiers,dispersing aids or binders may be desirable. 

20 Compositions and formulations for parenteral, intrathecal or intraventricular administration may 
include sterile aqueous solutions which may also contain buffers, diluents and other suitable 
additives such as, but not limited to, penetration enhancers, carrier compounds and other 
pharmaceutically acceptable carriers or excipients. 

25 Pharmaceutical compositions of the present invention include, but are not limited to, solutions, 
emulsions, and liposome-containing formulations. These compositions may be generated from a 
variety of components that include, but are not limited to, preformed liquids, self-emulsifying 
solids and self-emulsifying semisolids. 

30 The pharmaceutical formulations of the present invention, which may conveniently be presented in 
unit dosage form, may be prepared according to conventional techniques well known in the 
pharmaceutical industry. Such techniques include the step of bringing into association the active 
ingredients with the pharmaceutical carrier(s) or excipient(s). In general, the formulations are 
prepared by uniformly and intimately bringing into association the active ingredients with liquid 

35 carriers or finely divided solid carriers or both, and then, if necessary, shaping the product. 
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The compositions of the present invention may be formulated into any of many possible dosage 
forms such as, but not limited to, tablets, capsules, liquid syrups, soft gels, suppositories, and 
enemas. The compositions of the present invention may also be formulated as suspensions in 
aqueous, non-aqueous or mixed media. Aqueous suspensions may further contain substances 
5 which increase the viscosity of the suspension including, for example, sodium 

carboxymethylcellulose, sorbitol and/or dextran. The suspension may also contain stabilizers. 

In one embodiment of the present invention, the pharmaceutical compositions may be formulated 
and used as foams. Pharmaceutical foams include formulations such as, but not limited to, 
10 emulsions, microemulsions, creams, jellies and liposomes. While basically similar in nature these 
formulations vary in the components and the consistency of the final product. The preparation of 
such compositions and formulations is generally known to those skilled in the pharmaceutical and 
formulation arts and may be applied to the formulation of the compositions of the present 
invention. 

15 

The formulation of therapeutic compositions and their subsequent administration is believed to be 
within the skill of those in the art. Dosing is dependent on severity and responsiveness of the 
disease state to be treated, with the course of treatment lasting from several days to 
several months, or until a cure is effected or a diminution of the disease state is achieved. Optimal 
20 dosing schedules can be calculated from measurements of drug accumulation in the body of the 
patient. Persons of ordinary skill can easily determine optimum dosages, dosing methodologies 
and repetition rates. Optimum dosages may vary depending on the relative potency of individual 
oligonucleotides, and can generally be estimated based on EC50 as found to be effective in in vitro 
and in vivo animal models. 

25 

In general, dosage is from 0.01 ug to 10 g per kg of body weight, and may be given once or more 
daily, weekly, monthly or yearly, or even once every 2 to 20 years. Persons of ordinary skill in the 
art can easily estimate repetition rates for dosing based on measured residence times and 
concentrations of the drug in bodily fluids or tissues. Following successful treatment, it may be 
30 desirable to have the patient undergo maintenance therapy to prevent the recurrence of the disease 
state, wherein the oligonucleotide is administered in maintenance doses, ranging from 0.01 ug to 
10 g per kg of body weight, once or more daily, to once every 20 years. 

Gene Therapy 

35 As noted above, human carboxypeptidase M modulates actions of anaphylatoxins and kinins and 
human mouse double minute 2 homolog plays a role in cell proliferation. Therefore, the human 
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carboxypeptidase M gene may be used to modulate or prevent complement-linked tissue damage, 
in subjects in need thereof, for example, those exhibiting allergic reactions to a given substance. 
The human mouse double minute 2 homolog gene may be used to stimulate cell proliferation in 
subjects in need thereof, for example, for wound healing and those suffering from 
5 neurodegenerative or neuromuscular diseases, ischemic stroke, anoxia, ischemia/reperfusion 
damage and intoxication septic shock.. 

As described herein, the polynucleotide of the present invention may be introduced into a patient's 
cells for therapeutic uses. As will be discussed in further detail below, cells can be transfected 

10 using any appropriate means, including viral vectors, as shown by the example, chemical 

transfectants, or physico-mechanical methods such as electroporation and direct diffusion of DNA. 
See, for example, Wolff, Jon A, et al., "Direct gene transfer into mouse muscle in vivo," Science, 
247, 1465-1468, 1990; and Wolff, Jon A, "Human dystrophin expression in mdx mice after 
intramuscular injection of DNA constructs," Nature, 352, 815-818, 1991. As used herein, vectors 

1 5 are agents that transport the gene into the cell without degradation and include a promoter yielding 
expression of the gene in the cells into which it is delivered. As will be discussed in further detail 
below, promoters can be general promoters, yielding expression in a variety of mammalian cells, 
or cell specific, or even nuclear versus cytoplasmic specific. These are known to those skilled in 
the art and can be constructed using standard molecular biology protocols. Vectors have been 

20 divided into two classes: 

a) Biological agents derived from viral, bacterial or other sources. 

b) Chemical physical methods that increase the potential for gene uptake, directly introduce the 
25 gene into the nucleus or target the gene to a cell receptor. 

Biological Vectors 

Viral vectors have higher transaction (ability to introduce genes) abilities than do most chemical or 
physical methods to introduce genes into cells. Vectors that may be used in the present invention 

30 include viruses, such as adenoviruses, adeno associated virus (AAV), vaccinia, herpesviruses, 
baculoviruses and retroviruses, bacteriophages, cosmids, plasmids, fungal vectors and other 
recombination vehicles typically used in the art which have been described for expression in a 
variety of eukaryotic and prokaryotic hosts, and may be used for gene therapy as well as for simple 
protein expression. Polynucleotides are inserted into vector genomes using methods well known in 

35 the art. 
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Retroviral vectors are the vectors most commonly used in clinical trials, since they carry a larger 
genetic payload than other viral vectors. However, they are not useful in non-proliferating cells. 
Adenovirus vectors are relatively stable and easy to work with, have high titers, and can be 
delivered in aerosol formulation. Pox viral vectors are large and have several sites for inserting 
5 genes, they are thermostable and can be stored at room temperature. 

Examples of promoters are SP6, T4, T7, SV40 early promoter, cytomegalovirus (CMV) promoter, 
mouse mammary tumor virus (MMTV) steroid-inducible promoter, Moloney murine leukemia 
virus (MMLV) promoter, phosphoglycerate kinase (PGK) promoter, and the like. Alternatively, 

10 the promoter may be an endogenous adenovirus promoter, for example the El a promoter or the 
Ad2 major late promoter (MLP). Similarly, those of ordinary skill in the art can construct 
adenoviral vectors utilizing endogenous or heterologous poly A addition signals. 
Plasmids are not integrated into the genome and the vast majority of them are present only from a 
few weeks to several months, so they are typically very safe. However, they have lower 

15 expression levels than retroviruses and since cells have the ability to identify and eventually shut 
down foreign gene expression, the continuous release of DNA from the polymer to the target cells 
substantially increases the duration of functional expression while maintaining the benefit of the 
safety associated with non-viral transfections. 

20 Chemical/physical vectors 

Other methods to directly introduce genes into cells or exploit receptors on the surface of cells 
include the use of liposomes and lipids, ligands for specific cell surface receptors, cell receptors, 
and calcium phosphate and other chemical mediators, microinjections directly to single cells, 
electroporation and homologous recombination. Liposomes are commercially available from Gibco 

25 BRL, for example, as LIPOFECTIN* and LIPOFECTACE , which are formed of cationic lipids 
such as N-[l-(2,3 dioleyloxy)-propy l]-n,n,n-trimethylammonium chloride (DOTMA) and dimethyl 
dioctadecylammonium bromide (DDAB). Numerous methods are also published for making 
liposomes, known to those skilled in the art. 

30 For example, Nucleic acid-Lipid Complexes— Lipid carriers can be associated with naked nucleic 
acids (e.g., plasmid DNA) to facilitate passage through cellular membranes. Cationic, anionic, or 
neutral lipids can be used for this purpose. However, cationic lipids are preferred because they 
have been shown to associate better with DNA which, generally, has a negative charge. Cationic 
lipids have also been shown to mediate intracellular delivery of plasmid DNA (Feigner and 

35 Ringold, Nature 337:387 (1989)). Intravenous injection of cationic lipid-plasmid complexes into 
mice has been shown to result in expression of the DNA in lung (Brigham et ah, Am. J. Med. 
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Sci.298:278 (1989)). See also, Osaka et al., J. Pharm. Sci. 85(6):612-618 (1996); San et al., 
Human Gene Therapy 4:781-788 (1993); Senior et al., Biochemica et Biophysica Acta 1070:173- 
179 (1991); Kabanov and Kabanov, Bioconjugate Chem. 6:7-20 (1995); Remy et al., 
Bioconjugate Chem. 5:647-654 (1994); Behr, J-P., Bioconjugate Chem 5:382-389 (1994); Behr et 
5 al., Proc. Natl. Acad. Sci., USA 86:6982-6986 (1989); and Wyman et al., Biochem. 36:3008- 
3017 (1997). 

Cationic lipids are known to those of ordinary skill in the art. Representative cationic lipids include 
those disclosed, for example, in U.S. Pat. No. 5,283,185; and e.g., U.S. Pat. No. 5,767,099. In 
10 a preferred embodiment, the cationic lipid is N.sup.4 -spermine cholesteryl carbamate (GL-67) 
disclosed in U.S. Pat. No. 5,767,099. Additional preferred lipids include N4 _spermidine 
cholestryl carbamate (GL-53) and 1-(N4 -spermind) -2,3-dilaurylglycerol carbamate (GL-89). 

The vectors of the invention may be targeted to specific cells by linking a targeting molecule to the 
15 vector. A targeting molecule is any agent that is specific for a cell or tissue type of interest, 
including for example, a ligand, antibody, sugar, receptor, or other binding molecule. 

Invention vectors may be delivered to the target cells in a suitable composition, either alone, or 
complexed, as provided above, comprising the vector and a suitably acceptable carrier. The vector 

20 may be delivered to target cells by methods known in the art, for example, intravenous, 

intramuscular, intranasal, subcutaneous, intubation, lavage, and the like. The vectors may be 
delivered via in vivo or ex vivo applications. In vivo applications involve the direct administration 
of an adenoviral vector of the invention formulated into a composition to the cells of an individual. 
Ex vivo applications involve the transfer of the adenoviral vector directly to harvested autologous 

25 cells which are maintained in vitro, followed by readministration of the transduced cells to a 
recipient. 

In a specific embodiment, the vector is transfected into antigen-presenting cells. Suitable sources 
of antigen-presenting cells (APCs) include, but are not limited to, whole cells such as dendritic 
30 cells or macrophages; purified MHC class I molecule complexed to beta2-microglobulin and foster 
antigen-presenting cells. In a specific embodiment, the vectors of the present invention may be 
introduced into T cells or B cells using methods known in the art (see, for example, Tsokos and 
Nepom, 2000, J. Clin. Invest. 106:181-183). 

35 The invention described and claimed herein is not to be limited in scope by the specific 

embodiments herein disclosed, since these embodiments are intended as illustrations of several 



32 



aspects of the invention. Any equivalent embodiments are intended to be within the scope of this 
invention. Indeed, various modifications of the invention in addition to those shown and described 
herein will become apparent to those skilled in the art from the foregoing description. Such 
modifications are also intended to fall within the scope of the appended claims. 

Various references are cited herein, the disclosures of which are incorporated by reference in their 
entireties. 
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